feat: intial hudi reg test by rahil-c · Pull Request #3641 · apache/polaris

rahil-c · 2026-02-02T05:13:53Z

Summary

Adding intial regression test for polaris hudi integration, following exist pattern set by Delta regression test
Made changes in run.sh and setup.sh in order to ensure that spark session can be started correctly depending on the table format.
Ran locally both delta regression test and hudi regression test to ensure they pass.

Checklist

🛡️ Don't disclose security issues! (contact security@apache.org)
🔗 Clearly explained why the changes are needed, or linked related issues: Fixes #
🧪 Added/updated tests with good coverage, or manually tested (and explained how)
💡 Added comments for complex logic
🧾 Updated CHANGELOG.md (if needed)
📚 Updated documentation in site/content/in-dev/unreleased (if needed)

dimas-b

Thanks for your contribution, @rahil-c ! I'm juts wondering if using shell might be an overkill for this test. Specific comment thread below.

dimas-b · 2026-02-02T17:18:22Z

plugins/spark/v3.5/regtests/spark_hudi.sh

+  "http://${POLARIS_HOST:-localhost}:8181/api/catalog/v1/config?warehouse=${CATALOG_NAME}"
+echo
+echo "Catalog created"
+cat << EOF | ${SPARK_HOME}/bin/spark-sql -S \


Would it be possible to run this test as an Integration test under JUnit inside the Gradle builld?

Hi @dimas-b we currently have the following integration test for polaris hudi here: #3194

In terms of the reg test, I was the following the shell pattern that @gh-yzou had done for Delta, now for Hudi.

Hi @dimas-b The purpose of this regression test is to validate the end-to-end user experience when using Spark with both --packages and --jars. This is an important scenario that cannot be fully covered by integration tests.
While it is true that this test is relatively expensive, that is why it includes only very basic test cases. More complex scenarios and edge cases are covered by integration tests, which provide a more cost-effective approach.

I believe Spark-based scenarios can be covered in regular CI with sufficient certainty... but I do not mean to block this PR on this point 🙂 Please consider optional.

gh-yzou · 2026-02-09T05:44:24Z

plugins/spark/v3.5/regtests/run.sh

+# Define test suites to run
+# Each suite specifies: test_file:table_format:test_shortname
+declare -a TEST_SUITES=(
+  "spark_sql.sh:delta:spark_sql"


How about let's enforce the test file name to the format like xxx_<table_format>.sh, and have a separate folder to include all test src file and reference file. Then we just need to list the folder to get all test files, and extract the table format by parsing the file name. The benefit would be easy to onboard new tests, and developer doesn't have to input a long string when running single test (just the file name)

gh-yzou · 2026-02-09T05:45:20Z

plugins/spark/v3.5/regtests/setup.sh

 # this is mostly useful for building the Docker image with all needed dependencies
-${SPARK_HOME}/bin/spark-sql -e "SELECT 1"
+if [[ "$TABLE_FORMAT" == "hudi" ]]; then
+  # For Hudi: Pass --packages on command line to match official Hudi docs approach


i don't think we need the if else here anymore

gh-yzou · 2026-02-09T05:45:27Z

plugins/spark/v3.5/regtests/spark_hudi.sh

+rm -rf /tmp/spark_hudi_catalog/
+
+curl -i -X DELETE -H "Authorization: Bearer ${SPARK_BEARER_TOKEN}" -H 'Accept: application/json' -H 'Content-Type: application/json' \
+  http://${POLARIS_HOST:-localhost}:8181/api/management/v1/catalogs/${CATALOG_NAME} > /dev/stderr


gh-yzou · 2026-02-17T03:08:09Z

plugins/spark/v3.5/regtests/run.sh

+  exit 1
+fi
+
+parse_test_suite() {


can we add a comment here about what this function is doing, it is trying to extract the TABLE_ROMAT, TEST_SHORTNAME and the full path of TEST_FILE, right?

gh-yzou · 2026-02-17T03:10:35Z

plugins/spark/v3.5/regtests/run.sh

+  exit 1
+fi
+
+# Allow running specific test via environment variable


I think we can potentially also allow running all suites for a particular format by taking table format as an argument to this script. We can probably do that in a separate PR as an improvement.

lets do in another pr

feat: intial hudi reg test

b7e7556

github-project-automation bot added this to Basic Kanban Board Feb 2, 2026

github-project-automation bot moved this to PRs In Progress in Basic Kanban Board Feb 2, 2026

dimas-b reviewed Feb 2, 2026

View reviewed changes

rahil-c added 2 commits February 8, 2026 17:34

try patch

d3330cd

remove comments

a72471e

gh-yzou reviewed Feb 9, 2026

View reviewed changes

rahil-c added 2 commits February 15, 2026 11:19

address yun comment

8cc8e7a

address minor comment

fdf5c9c

rahil-c requested a review from gh-yzou February 15, 2026 20:51

gh-yzou previously approved these changes Feb 17, 2026

View reviewed changes

github-project-automation bot moved this from PRs In Progress to Ready to merge in Basic Kanban Board Feb 17, 2026

minor comment

12b191c

rahil-c dismissed gh-yzou’s stale review via 12b191c February 17, 2026 20:11

rahil-c requested a review from gh-yzou February 17, 2026 20:11

gh-yzou approved these changes Feb 17, 2026

View reviewed changes

flyrain approved these changes Feb 18, 2026

View reviewed changes

flyrain merged commit 893722c into apache:main Feb 18, 2026
15 checks passed

github-project-automation bot moved this from Ready to merge to Done in Basic Kanban Board Feb 18, 2026

Comments

Conversation

rahil-c commented Feb 2, 2026

Summary

Checklist

Uh oh!

dimas-b left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rahil-c Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rahil-c Feb 9, 2026 •

edited

Loading